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ABSTRACT 

We present rest-frame optical spectra for a sample of 9 low-mass star-forming galaxies in the redsliift 
range 1.5 < z < 3 which are gravitationally lensed by foreground clusters. We used Triplespec, an 
echelle spectrograph at the Palomar 200-inch telescope that is very effective for this purpose as it 
samples the entire near-infrared spectrum simultaneously. By measuring the flux of nebular emission 
lines we derive gas phase metallicities and star formation rates, and by fitting the optical to infrared 
spectral energy distributions we obtain stellar masses. Taking advantage of the high magnification 
due to strong lensing we are able to probe the physical properties of galaxies with stellar masses in the 
range 7.8 < log M/Mq < 9.4 whose star formation rates are similar to those of typical star-forming 
galaxies in the local universe. We compare our results with the locally determined relation between 
stellar mass, gas metallicity and star formation rate. Our data are in excellent agreement with this 
relation, with an average offset (Alog(0/H)) = 0.02 ± 0.08, suggesting a universal relationship. 
Remarkably, the scatter around the fundamental metallicity relation is only 0.24 dex, smaller than 
that observed locally at the same stellar masses, which may provide an important additional constraint 
for galaxy evolution models. 

Subject headings: galaxies: abundances - galaxies: evolution - galaxies: high-redshift - gravitational 
lensing: strong 



1. INTRODUCTION 

The gas-phase metallicity represents a fundamental 
property of galaxies and can be used to investigate the 
complex physical processes that govern galaxy evolution. 
It mainly traces the star formation history, as metals pro- 
duced in stars are ejected into the interstellar medium 
(ISM) , but the exchange of material between the galaxy 
and the intergalactic medium (IGM) also plays an im- 
portant role. The accretion of metal-poor gas from the 
IGM can dilute the metal content of the gas in a galaxy. 
Also, stellar winds can substantially lower the metallicity 
by ejecting metals. 

Despite the complexity of these processes, a clear rela- 
tion between galaxy lu minosity and metallici ty has been 
known since the work of lLequeux et al.l (|1979f ) . Recently, 
thanks to the vast amount of spectroscopic and photo- 
metric data available from the Sloan Digital Sky Survey 
(SDSS), it has become clear that the physical parame- 
ter that correlates most strongly with me tallicity is the 
galaxy stellar mass (jTremonti et al.l 120041 ) . This mass- 
metallicity relation, in which galaxies of higher masses 
contain larger metallicities, is remarkably tight over 3 
orders of magnitude in stellar mass, with a dispersion of 
only 0.10 dex in metallicity. 

A natural explanation for the observed mass- 
metallicity relation is the outflow of metal-enriched gas 
driven by star formation. Because of the lower gravi- 
tational potential, low-mass galaxies lose a higher frac- 
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tion of their gas, with a consequent decrease in metallic - 
ity (|Larsod fl97l iGarnettl [200l iTremonti et al.|[200l . 
An alternative possibility is that lower mass galaxies 
are less metal-rich because their st ar formation histor y 
has been developed more gradually ([Ellison et al.ll2008h . 
in agreement with the now-familiar effect of downsizing 
(|Cowie et al.lll996l) . 

Different models of galaxy formation and evolution are 
able to match the mass-metallicity relation in the local 
universe, but have dissimilar predictions for high redshift 
galax i es fe.g.lDe Lucia et al.ll2004l:lDave fc Qppenheimer l 
| 2007t iTassis et al.ll2008t iDave et all l2011al lYates et al.l 
2012). Observing the redshift evolution of the mass- 
metallicity relation can therefore differentiate these mod- 
els. Observations at different redshifts have shown a 
clear evolution with cosmic time, w ith lower metallicity 
at higher redshift , for a fixed mass (iSayariio et al.1120051: 
Erb et al.l l2006at jMaiolino et al.l 120081: iMannucci et al.l 
20091: iZahid et alj|20lH lYuan et al.ll20I3D . 

However, it is important to recognize that high-redshift 
studies target galaxy populations that are different from 
those found typically in the local Universe. The evolution 
of the mass-metallicity relation could then be the result 
of a selection effect rather than a change in the physical 
properties of the galaxies with cosmic time. 

Among the differences between local and high-redshift 
galaxies, star formation activity is one of the most im- 
portant. At earlier cosmic times, the star formation rate 
(SFR) was on average much higher than today, because 
galaxies contained a larger amount of cold gas. Also, 
most of the high-redshift surveys are magnitude-limited 
in the rest-frame UV, and therefore tend to select galax- 
ies with high SFR. The combination of these two ef- 
fects makes it very difficult to compare the metallicity of 
galaxies at different redshifts with the same stellar mass 
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and star formation rate. It is then essential to study the 
relation between SFR and metallicity, since this could 
have important consequences on the interpretation of the 
observed e volution of the mass-m etallicity relation. 

In fact, iMannucci et al.l (|2010( ) found that the local 
mass-metallicity relation is different for samples of galax- 
ies with different star formation rates. Furthermore, they 
showed that the SDSS galaxies lie on a tight 3D surface 
in the mass-metallicity-SFR space, wi th a dispersion of 
only 0.053 dex in metallicity (see also lLara-Lopez et al.l 
120101 ). According to this fundamental metallicity rela- 
tion (FMR), at fixed stellar mass SFR and metallicity 
are anti-correlated. If this relation holds independently 
of cosmic time, a galaxy population at high redshift will 
tend to have a low average metallicity because of its 
high SFR. In this scenario the evolution of the mass- 
metallicity relation is driven by the shifting of galaxy 
populations on the SFR-mass plane, rather than being 
directly caused by the evolution of some physical pro- 
cess. Clearly, testing the fundamental metallicity rela- 
tion at different redshifts is of primary importance. 

The redshift evolutio n of the FMR was first explored by 
IMannucci et al.l (|2010fl using samples from the literature, 
and they concluded that the local relation is a good fit for 
any star- forming galaxy up to z ~ 2.2. But high- redshift 
observations are biased towards high- SFR galaxies, and 
a direct test using galaxies with the same range of star 
formation rates that is seen in the SDSS sample (SFR < 
10 M Q /yr) is still lacking. Additional difficulties come 
from the fact that to measure the metallicity one needs 
the rest-frame optical emission lines, which at z ~ 2 are 
redshifted into the near-infrared, a spectral region where 
sky emission is strong. 

One way to probe lower star formation rates with the 
current technology is to take advantage of strong gravita- 
tional lensing. The magnification induced by foreground 
galaxy clusters allows one to reach faint objects, corre- 
sponding to stellar masses and SFRs (on average) lower 
than the values achievable without lensing. Recent stud- 
ies of the FMR for high-redshift lensed galaxies found a 
general agreement wit h the local relation, a lthough with 
a very large scatter (iRichard et afl 120111 : iWuyts et al.1 
I2012M IChristensen et al.H2012ft . 

Testing the universality of the fundamental metal- 
licity relation is important not only for understanding 
the evolution of the mass-metallicity relation, but also 
to constrain mode ls of galaxy evolution. For example, 
IDave et"al~l (|2011aD consider a simple model in which in- 
flow, outflow and star formation are in equilibrium and 
determine the gas metallicity. This scenario can quali- 
tatively explain the dependence of the mass-metallicity 
relation on the star formation rate: at a fixed stellar mass 
a high SFR is caused by a large inflow, that in turn im- 
plies a low metallicity. If a galaxy is perturbed, e.g. by a 
merger, it will move away from the FMR and, after some 
time, will return to the equilibrium configuration. This 
equilibrium timescale determines the scatter in the mass- 
metallicity-SFR relation. So long as the yield of metals 
per unit star formation and the mass loading factor (i.e. 
the ratio between outflow and star formation) are con- 
stant, the equilibrium relation implies a universal FMR 
independent of reds hift. Although this and other sim- 
ple analytic models (|Daval et alj|2012t IDave et al.ll2012ft 
succeed in explaining the local fundamental metallicity 



relation, we are still far from a detailed understanding of 
the relevant physical processes. High-redshift observa- 
tions are essential for quantitative tests of hydrodynam- 
ical simulations of galaxy evolution. 

In this work we study a sample of low-luminosity, z ~ 2 
gravitational arcs with magnification factors of ~10-100. 
We used the Triplespec spectrograph on the Palomar 
200-inch telescope that features a good sensitivity and 
covers the full near-infrared wavelength range. This char- 
acteristic makes it an ideal instrument for such a study, 
because it allows us to observe all diagnostic lines of in- 
terest simultaneously. In addition to the efficiency of ob- 
servation, a particular benefit is the ability to measure 
emission line ratios, the diagnostics of gas-phase metallic- 
ities, in a single exposure, mitigating uncertainties that 
arise from variations in weather conditions. Also, we 
mainly rely on the emission lines [OIIIJA5007 and H/3 for 
measuring the metallicity, and not on the [Nil] A6584/Ha 
ratio, so that we are not limited by the requirement of 
detecting the faint [NII]A6584 line. For these reasons we 
probe stellar masses and star formation rates that are on 
average lower than the ones of previously studied samples 
of lensed galaxies. 

The sample of gravitational arcs, the spectroscopic ob- 
servations, and data reduction are described in Section 2. 
In Section 3 we present the photometric measurements 
and the fitting of the spectral energy distribution, while 
in Section 4 we explore the galaxy physical properties us- 
ing the measured line fluxes. We discuss the constraints 
of these measurements on the evolution of the fundamen- 
tal metallicity relation in Section 5, and the implications 
for galaxy evolution models in Section 6. We assume a 
A cold dark matter (ACDM) cosmology with = 0.7, 
O m = 0.3 and Hq — 70 km s~ 1 Mpc~ 1 . Magnitudes are 
given in the AB system. 

2. DATA 
2.1. Sample Selection 

We selected our sample of gravitational arcs from the 
literature according to the following three criteria. 

First, we considered only galaxy clusters with a well- 
constrained lens model. This allowed us to select arcs of 
known magnification fj, > 10. 

Second, the observed (i.e. not corrected for lensing) arc 
magnitude must be R < 23, so that observations with 
Triplespec at Palomar are feasible. This means that we 
can probe intrinsic magnitudes R > 25.5, fainter than 
the limits of typical non-lensing surveys. 

Third, the arc should have a known spectroscopic red- 
shift such that the emission lines from [Oil] to Ha fall 
in the wavelength range observable with Triplespec. The 
ideal range is 2 < z < 2.5, but we can measure metal- 
licities and star formation rates using lines available for 
galaxies from z~1.5toz~3. 

This selection provides us with 10 sources viewed 
through 10 distinct clusters (see Table []}. For the arc 
in RXJ1720 only Ha is observable because the other di- 
agnostic lines are unfortunately obscured by night sky 
emission. We do not attempt any analysis on this ob- 
ject, but we include it in Table Q] because ours is the 
first redshift measurement for this arc obtained from a 
rest-frame optical emission line. 

From now on we will refer to the gravitational arcs 
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TABLE 1 
Sample of Gravitational Arcs 



Cluster 


Arc 




Reference 


Run 


Exp. Time 


Redshift' 


A611 


1.2, 1.3 


19.6 ± 3.0 


Newman et al. (2009) c 


B 


lh20min 


1.4902 


RXJ2129 


1.1, 1.2 


61 ± 17 


Richard et al. (2010) 


A 


5h30min 


1.5221 


A1413 


2.1, 2.2 


23.9 ± 6.4 


Richard et al. (201oj 


B, D 


3h00min 


2.0376 


A1835 


7.1 


88 ± 30 


Richard et al. (2010) 


B 


lh45min 


2.0733 


RXJ1720 


1.1, 1.2 


22.5 ± 9.1 


Richard et al. (2010) 


A 


4h00min 


2.2200 


A773 


1.1, 1.2 


27.9 ± 7.9 


Richard et al. (2010) 


C 


3hl5min 


2.3032 


MACS0717 


13.1 


7.2 ± 3.0 


Limousinetal. (2011) 


C, D 


5h30min 


2.5515 


A383 


3C, 4C 


23.9 ± 3.3 


Newman et al. (2011) 
Coe et al. (*20Toi 


C 


2h00min 


2.5771 


A1689 


1.1, 1.2 


57 ± 23 


D 


6h30min 


3.0421 


A1703 


3.1, 3.2 


46 ± 20 


Richard et aH72009) 


B 


3h30min 


3.2847 



a Gravitational magnification. For multiply imaged sources, this is the sum of the magnifica- 
tions. 

b Redshifts measured from [OIII]A5007, except for RXJ1720, for which Ha was used. The 
uncertainties are always less than 0.0003. 

c Magnification factor calculated after updating the lensing map with the new arc redshift, see 
Section g] 



TABLE 2 

Observing runs 



Run 


Date 


Seeing 


(arcsec) 


A 


2010 August 23, 25, 26 


0.9 


- 1.3 


B 


2011 April 10, 11, 12, 13 


0.9 


- 1.3 


C 


2012 January 12, 13. 14 


1.1 


- 2.0 


D 


2012 April 29, 30, May 1 


0.9 


- 1.5 



by the names of the corresponding galaxy clusters, e.g. 
A1835 for A1835 arc 7.1. 

2.2. Spectroscopy 

All spectroscopic data were taken with Triplespec on 
the 200-inch Hale Telescope at Palomar Observatory over 
the course of four observing runs (see Table ^ . Triple- 
spec is a near-infrared cross-dispersed spectrograph that 
simultaneously covers the wavelength range 1-2.4 /zm 
with a resolution R ~ 2700 (jHerter et al.ll2008l ). 

The 1 x 30 arcsec long slit was positioned on the tar- 
gets as shown in Fig. [T]via a blind offset from a bright 
star. For each target we typically undertook many expo- 
sures of 300-450 seconds each, using a two-point dither- 
ing pattern. The position of the target along the slit 
and the dithering offset were carefully chosen for each 
arc, avoiding any overlap of the arc with foreground clus- 
ter galaxies, and leaving enough blank sky along the slit 
to reliably measure and subtract background emission. 
When it was possible to arrange a multiply imaged sys- 
tem in the slit, the spectra, if resolved, were reduced and 
extracted separately and then combined. 

The spectroscopic d ata were reduced using a modi- 
fied v ersion of Spextool (jCushing et al.ll2b04t IVacca et al.l 
2004). For each target we extracted the spectrum from 
each A— B pair and then combined the ID spectra. The 
extraction aperture was defined using the [OIII] emis- 
sion profile in the stack of many A— B pairs, since the 
line emission is rarely detected in single frames. 

Flux calibration a nd correction for t elluric absorption 
were performed using lElias et al.l (|1982j ) A-type standard 
stars. Although absolute flux calibration can be very 
uncertain, it affects only the emission line fluxes but 
not their ratios, which are used in calculating gas-phase 
metallicities. This is one of the main advantages of using 
Triplespec, which allows one to observe the entire near- 



infrared spectrum at once. However, SFR measurements 
are affected by absolute flux calibration, that therefore 
needs to be carefully quantified. 

The accuracy of the absolute flux calibration is limited 
by many factors. First, the observing conditions were 
not always photometric, and a good fraction of our ob- 
servations were affected by the presence of clouds. This 
problem is mitigated by the fact that each target was 
observed on more than one night, and we scaled the dif- 
ferent frames to match the line fluxes observed during 
the best conditions. The second issue arises from the 
fact that every few minutes the pointing was changed 
according to the dithering pattern. As a result, the slit 
alignment is not identical in each frame, and the ob- 
served flux changes as a function of the centering of the 
target. We tested the significance of this effect using the 
standard star observations, where the target is bright 
enough to compare the flux in different frames. The dis- 
crepancy in the absolute flux between different frames is 
typically less than 50%. This represents an upper limit 
on the flux uncertainty, since this random effect is at- 
tenuated by averaging together multiple frames for each 
standard star. Also, the science observations were made 
on much longer timescales, and the guiding was overall 
very stable, as we could check from the guider images 
taken during the exposure. Another possible source of 
uncertainty is the highly variable seeing. However, the 
difference in seeing (which was almost always larger than 
the slit width) between the science target and the stan- 
dard star observations has a much smaller effect than 
the slit misalignment. Since it is difficult to quantify the 
real uncertainty, we conservatively use the value of 50% 
as fractional error in the flux calibration. This corre- 
sponds to a contribution of 0.22 dex to the uncertainty 
in the SFR. 

The calibrated spectra together with their l-er error 
are shown in Fig. [T] The emission line [OIIIJA5007 is 
well-detected in each spectrum; other observed emission 
lines are Ha, H/3, [OIH1A4959, and [OII]A3727, 3729. The 
fainter lines [NH1A6584 and [NeIII]A3870 are detected 
only in a few cases. The continuum emission from the 
arcs is never detected, but the residuals from sky subtrac- 
tion or the emission from foreground galaxies can cause 
the observed continuum to be different from zero. 
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A611 (z - 1.4902) 




1.21 1.22 1.23 1.24 1.25 1.63 1.64 

Observed Wavelength (|im) 




1-10* 2-1 4 3-1 4 4>10 4 5*1 4 

Observed Wavelength (angstrom) 



RXJ2129 (z = 1.5222) 




1.22 1.23 1.24 1.25 1.26 1.27 1.65 1.66 
Observed Wavelength (prn) 



MO* 2'10 4 3-1 4 4>10 4 5*1 4 

Observed Wavelength (angstrom) 



A1413 (z- 2.0376) 




A1413 



1.13 1.141.15 1.161.171.18 1.48 1.49 1.50 1.51 1.52 1.99 2.00 2.01 
Observed Wavelength (jim) 



1-10* 2-10* 3-1 4 4>10 4 5*1 4 

Observed Wavelength (angstrom) 



A1835 (z- 2.0733) 




1.14 1.16 1.18 1.20 1.50 1.52 1.54 
Observed Wavelength (jim) 




1-10 4 2-10" 3-10* 4-10 4 5-10" 

Observed Wavelength (angstrom) 



Fig. 1. — HST image stamp, Triplespec spectrum, and SED fit for each gravitational arc. Left: The Hubble Space Telescope images 
(F702W or F775W) show the position of the Triplespec slit. Each square is 15 arcseconds on the side. Center: Each spectrum has been 
inverse-variance smoothed using a 5-pixel window, and the bottom panels show the 1-cr error in the same units as the flux. When multiple 
images are present on the slit, the spectrum shown is the combined spectrum. Right: The photometry from (observed) UV to infrared 
is plotted as filled points, and the color corresponds to the type of data: red for HST, purple for ground-based near-infrar ed, a nd green 
for Spitzer IRAC images. The empty points are photometric measurements not corrected for emission line flux (see Section 13.21 1, and the 
best-fit synthetic spectrum is shown as a solid line. For objects with two gravitational images, the SED is plotted only for the one that is 
less contaminated by foreground galaxies, and its magnification factor is shown. Spectra and photometry are not corrected for the lensing 
magnification. 
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A773 (z- 2.3032) 




1.22 1.23 1.24 1.60 1.61 1.62 1.63 1.64 1.65 1 
Observed Wavelength (jim) 



1-10 4 2-10 4 3*10* 4>10 4 5-10 4 

Observed Wavelength (angstrom) 



MACS0717 (z = 2.5515) 




1.32 1.33 1.72 1.73 1.74 1.75 1.76 1.77 1.78 1.79 2.33 2.34 
Observed Wavelength (jim) 



MO 4 2*1 4 3-1 4 4>10 4 

Observed Wavelength (angstrom) 



A383 (z- 2.5771) 





1.32 1.33 1.34 1.74 1.75 1.76 1.77 1.78 1.79 1.80 2.34 2.35 2.36 
Observed Wavelength (jim) 



M0 4 2-1 4 3-1 4 4>10 4 

Observed Wavelength (angstrom) 



A1689 (z- 3.0421) 




E 3-10 




1.50 1.51 1.95 1.96 1.97 1.98 1.99 2.00 2.01 2.02 2.03 
Observed Wavelength (|im) 



MO 4 2*10 4 3-1 4 4-1 4 

Observed Wavelength (angstrom) 
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A1703 (z = 3.2847) 




A 1703 



[Oil] 


[Ne III] 


H fl 


[0 III] [0 


III] 

- 










WW 


' ii ' i ' i ' 'i 

JluiJjliJUuiAL. 


' ' I I' ' ' I ' ' ' I I 1 ' I ' 

A — r |j\J\_ r ^yy\^A T J*^j\ T /v — , 


1 1 I 1 
JvVv~ 




1.62 1.64 1.66 2.08 2.10 
Observed Wavelength (urn) 



2.12 2.14 2.16 



MO" 2-1 0" 3-10* 4-1 0" 5-1 0" 

Observed Wavelength (angstrom) 



Fig. 1. — Continued. 



2.3. 



Imaging 

We now discuss the imaging data for our sources which 
will provide the essential ingredients for measuring the 
stellar masses and other physical properties. To accu- 
rately derive the stellar mass it is necessary to sample 
the spectral energy distribution (SED) redward of the 
rest-frame Balmer break. For z < 2 objects optical imag- 
ing is sufficient, but at higher redshift infrared data are 
needed. 

2.3.1. Archival Data 

Since we chose well-studied galaxy clusters, space- 
based images from Hubble Space Telescope (HST) and 
Spitzer observations are available for each target in our 
sample. 

For four of the objects (A383, MACS0717, RXJ2129, 
and A611) we used publicly available data from the Clus- 
ter Lensing and Supe rnova Survey with Hubble (CLASH, 
iPostman et al.ll2012f ). This allowed us to measure the 
photometry for each arc in about 16 bands from UV to 
near-infrared. For the remaining targets we used archival 
HST images; each source has imaging available in at least 
four bands except A1413, for which only two bands are 
available. 

All of the selected targets have publicly available 
Spitzer IRAC observations, but due to the faintness 
of the gravitational arcs not all of them are detected. 
The arcs with useful IRAC imaging are A611, RXJ2129, 
A773, and MACS0717. We used channel 1 (3.6 fim) and 
channel 2 (4.5 /jm) observations from Spitzer program 
60034 (PI: E. Egami) for all of the arcs, and data from 
program 83 (PI: G. Rieke) for A773. 

Additionally, some ground-based near-infrared data 
have been used for a few arcs : VLT ISAAC Ks-band 
for A1835 (|Richard et al.l[200l and A1689 (J. Richard 
et al., i n preparation), and Subaru MOIRCS iJ-band for 
A1703 (jRichard et al.l [2009h . 

2.3.2. Palomar Observations 

Using ground and space-based archival data we can 
probe the spectral region redward of the Balmer break 
for each arc, with the exclusion of A1413. Since this type 
of photometry is crucial for measuring the stellar mass, 
we took ifg-band imaging for A1413 using WIRC on the 
Palomar 200-inch telescope. We used a 9-point dithering 
pattern of two-minute frames for a total exposure time 
of 198 minutes. 

3. PROPERTIES OF STELLAR POPULATIONS 



3.1. Photometry 

Since the arcs experience a large gravitational magnifi- 
cation, they tend to lie at short projected distances from 
foreground galaxies. It may then be necessary to subtract 
the light of the cluster galaxies in order to reliably mea- 
sure the arc phot ometry. In these cases we used Galfit 
(|Peng et al.ll2002[ ) , which allowed us to simultaneously fit 
many galaxies while taking into account the Point Spread 
Function (PSF) of the instrument, which was measured 
from bright, isolated stars in the field. This is particu- 
larly important when working with Spitzer IRAC images, 
which present a very large and asymmetric PSF. 

Sometimes the gravitational arc emission is contami- 
nated by the cluster bright central galaxy (BCG) lumi- 
nous halo, and as a result the local background is hard 
to model. In these cases we fit the BCG using the Multi- 
Gauss ian Expansion algorithm developed by iCappellaril 
( 2002) . This method consists of fitting the surface bright- 
ness of a galaxy with a series of two-dimensional Gaus- 
sian functions, and is very effective for bright, extended 
galaxies. 

After the subtraction of the foreground galaxies, the 
arc photometry is measured using polygonal apertures. 
The major source of uncertainty is generally the mod- 
eling of foreground galaxies and, for some IRAC images 
of crowded fields, confusion. The relative error in the 
flux is ~30% in the worst cases, but typically much less 
than that. All the photometric measurements are cor- 
rected for gal a ctic e xtinction according to the map of 
iSchleeel et all (fl998l) . 

The photometric points are plotted for each arc in Fig- 
ure Q] with a different color for each set of observations: 
red for Hubble Space Telescope, purple for ground-based 
near-infrared, and green for Spitzer data. When two 
gravitational images corresponding to the same source 
are available, the photometric analysis has been carried 
out independently on the different images, and the one 
less affected by foreground contamination was selected 
for the SED fitting. In these cases the name and the 
magnification factor of the chosen image are reported in 
Figure [TJ 

The WIRC Kg-band data for A1413 are not deep 
enough to detect the faint gravitational arc, because the 
contamination from the foreground BCG galaxy's halo 
is very strong. In this case we can only derive an upper 
limit on the flux, and we show it in Figure [T] 

3.2. SED Fitting 
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For each target we fit the photometry from the ob- 
serve d UV to infrared us i ng the stellar population mod- 
els of iBruzual fc Chariot! (|2003l) in order to measure the 
stellar mass and other physical properties of the galax- 
ies. We performed t he fit using the ch i-square mini- 
mization code FAST ()Kriek et all [2009) assu ming the 
iCalzetti et al.l (|2000D dust extinction law and a lChabrierl 
(2003) initial mass function. 

Since the emission lines detected in the Triplespec spec- 
tra are extremely bright compared to the continuum, we 
subtracted the measured line fluxes (see Section from 
the appropriate photometric bands. This correction can 
be as high as 40% for a single band, but its effect on the 
stellar mass estimate is generally small because of the 
large number of photometric points used in the fit. For 
targets with multiple images on the slit the correction is 
calculated taking the flux from the combined spectrum, 
which has the advantage of a better signal to noise ratio, 
and then appropriately scaling it using the HST photom- 
etry. 

Various stellar population parameters and degenera- 
cies are involved in the process of SED fitting. One of 
the most important assumptions, and one that strongly 
affects the best-fit current SFR, is the star formation 
history. The widely used exponentially declining star 
formation history, or r-model, may not be an appro- 
priate choice since these galaxies are young and have a 
large gas reservoir. Sometimes an inverted r-model is 
used for star-forming galaxies at high redshift. Both of 
these models require strong assumptions on the current 
state of the galaxy, i.e. that its star formation is cur- 
rently at its minimum, or maximum, respectively. Also, 
they both introduce the free parameter r which is usu- 
ally not well-constrained by the data. For exponentially 
declining star formation histories, models with r < 300 
Myr can give a formally acceptable fit to the data but 
usually fail in reproducing th e SFR derived using other 
indicators ()Wuvts et al.ll201l"l ). Since star-forming galax- 
ies at high redshift are young, large values of r imply a 
nearly flat star formation history. For these reasons we 
make the simpl ifying assumpt i on of a constant star for- 
mation history. IShapley et a l. (2005b) consider a sample 
of star forming galaxies at z ~ 2 and show that the agree- 
ment between stellar masses derived assuming r-models 
or constant star formation history is very good, with no 
systematic offset and negligible dispersion. They also 
conclude that the choice of a particular SFH does not 
affect the unce rtainty in the stellar mass measurement. 

As shown by IWuvts etaTl (|2012al) . the SED fit of low- 
mass star-forming galaxies tends generally to favor ex- 
tremely young ages. Since a galaxy age cannot be smaller 
than the dynamical timesc ale, a lower limit on t he SED 
age is often set. Following iWuvts et al.1 (|2012a[ ) we use 
70 Myr as a lower limit, and the age of the Universe at 
the observed redshift as an upper limit. The effect of the 
age limit is not critical: lowering it to 20 Myr causes an 
average increase of 0.10 dex in both stellar mass and star 
formation rate. 

One of the parameters involved in the SED fitting is 
the metallicity of the stellar population. This is differ- 
ent from the gas-phase metallicity, that we measure from 
rest-frame optical emission lines (see Section 14.31) and 
that can be higher than the stellar metallicity. The al- 
lowed values of stellar metallicity for the SED fitting are 



0.2, 0.4 and 1.0 in units of solar metallicity. For some of 
the arcs, the best-fit value is significantly larger than the 
gas-phase metallicity. We attribute this unphysical result 
to the effect of SED fitting degeneracies. We performed 
a test for each arc by fitting the SED while keeping the 
metallicity fixed at the value closest to the one measured 
via emission lines. The resulting stellar population pa- 
rameters are negligibly affected, and their uncertainties 
change only marginally. 

It is important to note that among the SED fitting 
output parameter s, stellar mass is the most robust (e.g. 
IWuvts et al.l2007T ) , and is also the only one that is critical 
for our analysis. Star formation rate and dust extinction 
are more sensitive to the assumptions made, but it is pos- 
sible to compare them with independent measurements 
from the rest-frame optical emission lines. 

The best-fit spectra are shown in Figure [TJ and the 
output parameters (stellar mass, dust extinction, age, 
and current star formation rate) for each arc are listed 
in Table |4j Stellar masses and star formation rates are 
corrected for the gravitational magnification. The stel- 
lar masses are in the range 7.8 < log M/M & < 9.4, and 
are located at the low end of the mass d istribution of 
the SDSS sample (e.g. iZahid et al.ll2012af ). The uncer- 
tainties are between 0.1 and 0.3 dex except for A1413, 
for which the low number of photometric points yields 
a large uncertainty in the stellar mass, AlogM = 0.36 
dex. 

It is common practice to report the best-fit stellar mass 
(i.e. the one corresponding to the model that best de- 
scribes the photometric data) and the 68% confidence 
region. The error bars are often highly asymmetric, and 
are very difficult to propagate when using the SED re- 
sults in further analysis. In fact, a rigorous propagation 
of asymmetric error bars is possible only when the pos- 
terior distribution is known. Instead, we calculate the 
stellar mass posterior distribution from the chi-square 
grid output from FAST, and report the mean and the 
standard deviation of the distribution. The posterior 
distributions in log M are only weakly skewed and are 
well approximated by a Gaussian function. On the other 
hand, the best-fit value is often off-center, and choos- 
ing it as the best estimate would cause asymmetric er- 
ror bars. Reporting the mean and standard deviation 
has the advantage of a straightforward propagation of 
the uncertainty in following calculations, which is essen- 
tial for the present work. The same arguments apply 
to other stellar population parameters such as log SFR 
and dust extinction E(B — V), and we follow the same 
method for estimating their values. The stellar popula- 
tion age, however, presents a posterior distribution that 
is very skewed for those galaxies with a best-fit age near 
the lower limit, therefore we list the best-fit value and the 
asymmetric 68% confidence interval, which we do not use 
in any further analysis. 

4. SPECTROSCOPIC DIAGNOSTICS 

The goal of this study is to explore the relation between 
stellar mass, star formation rate and gas metallicity for 
star- forming galaxies at high redshift. In the previous 
section we derived the stellar masses using photometric 
data. In this section we use the rest-frame optical emis- 
sion lines of the gravitational arcs to measure their star 
formation rate and metallicity. 
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TABLE 3 
Emission line fluxes 



Source 


[OII]A3727, 3729 


[NeIII]A3870 


H/3 


[OIII]A4959 


[OIII]A5007 


Ho 


[NII]A6584 


A611 






15.9 ± 4.5 


34.0 ± 8.6* 


90.4 ± 4.1 


43.3 ± 6.0 


< 4.1* 


RXJ2129 






10.6 ± 5.1* 


11.8 ± 4.0* 


45.8 ± 7.3 


32.8 ± 5.9 


< 3.7 


A1413 


< 29* 


10.5 ± 3.7 


12.8 ± 3.2* 


60.8 ± 6.3* 


176 ± 3 


97.2 ± 3.7 


< 11* 


A1835 


36 ± 13 


5.8 ± 2.1 


16.6 ± 3.1 


< 14* 


44.3 ± 4.2 


45.4 ± 8.1 


5.5 ± 2.1 


A773 


23.8 ± 9.7 


< 24 


8.3 ± 3.5* 


15.3 ± 3.4 


49.0 ± 5.2 


47.2 ± 7.6 


< 9.2 


MACS0717 


19.3 ± 9.0* 


< 9.1 


8.5 ± 2.9 


4.4 ± 1.7 


18.3 ± 2.3 


18.9 ± 4.9 


< 6.0 


A383 


18.5 ± 5.9 


< 38* 


< 8.5* 


< 6.9* 


16.4 ± 2.2 


25.2 ± 7.1 


< 7.4 


A1689 


< 29* 


< 7.7* 


< 11* 


< 24* 


49.1 ± 4.9 






A1703 


12.7 ± 5.7 


5.5 ± 1.0 


11.2 ± 2.5 


34.0 ± 3.6 


81.1 ± 2.0 






Note. - 


Fluxes in units of 10 


-17 —1 
' erg s cm 


2 . The listed uncertainties apply to flux ratios, and do not include 



the error in the absolute flux calibration. For undetected lines the 2-ct upper limit is given. 
* Lines strongly contaminated by sky emission lines. These measurements are to be considered highly uncertain. 

t 1 1 1 1 1 1 1 r 



In order to derive physical quantities from the observed 
spectra, we need to quantitatively analyze the emission 
lines. Each emission line profile was fitted with one Gaus- 
sian (two for the doublet [OII]A3727, 3729). For each 
line we derived flux, redshift, width and continuum level 
from the fit. For the faintest lines we fixed one or more 
of these parameters using as a reference [OIII]A5007 or 
Ha, which have a relatively high signal to noise ratio. 
The line fluxes are reported in Table [3] The redshifts, 
measured from [OIIIJA5007, are given in Table [T] In two 
cases we found that previously published redshifts were 
incorrect (A611 and RXJ2129. iRichard et al]|2010H due 
to misidentification of rest-frame UV spectral features. 

4.1. Dust Extinction 

We estimated dust reddening using the Ha /H/3 flux ra- 
tio. Assuming a case B recombination and typical tem- 
perature (10,000 K) and density (100 cm" 3 ), the theo- 
retica l value of the ratio is 2.86 (|Osterbrock fc Ferlandl 
2006). We used the ICalzetti et all (|2000fl law to derive 
the dust extinction from the observed flux ratio. The 
results are shown in Figure [5] and compared to the dust 
extinction derived from the SED fitting. The two meth- 
ods are in good agreement and provide very low dust 
extinction for most of the arcs. Since the Balmer decre- 
ment is highly sensitive to the H/3 flux, we plot with a 
different symbol the arcs for which this line is contam- 
inated by OH sky lines. The only object with a large 
discrepancy between the two measurements of dust ex- 
tinction is A1413, for which the Balmer decrement gives 
E(B — V) ~ 0.8. We attribute this very large value to the 
effect of sky emission on the H/3 flux, since all the other 
galaxies in our sample have E(B — V) < 0.3. The SED fit 
for this galaxy, although uncertain, being based on only 
two photometric points, gives a dust extinction very sim- 
ilar to that found for the other arcs, (E(B — V)) = 0.13. 

The Balmer decrement probes the extinction in the 
HII regions, where the nebular emission originates, while 
the SED fit output applies to the overall stellar popu- 
lation of a galaxy. In principle by comparing the dust 
extinction obtained by the two methods it is possible to 
study the dust distribu tion, which can be c oncentrated 
in star-forming regions. ICalzetti etUI ()1994D found that 
for local starburst galaxies the nebular dust extinction 
is roughly twice the extinction of the stellar contin- 
uum. At z ~ 2 it is not clear whether there is a differ- 
ence between the reddening exper i enced by stellar and 
gas emission (e.g. lErb et aLll2006bt lHainline et all 120091 : 
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A611 RXJ2129 A1413 A1835 A773 MACS0717 A383 A1689 A1703 

Fig. 2. — Dust extinction E(B-V) derived from SED fitting (red 
circles), Balmer decrement (black squares), and Balmer decrement 
with H/3 affected by sky line contamination (black triangles). 



IForster Schreiber e t al. 20091) . Our data suggest a simi- 
lar amount of attenuation for the two components, but 
the H/3 flux determinations are too noisy to draw any 
conclusion. When in the following analysis we correct 
the emission line fluxes for dust extinction, we always 
use the SED fitting values, which are less affected by 
uncertainty. This method could underestimate the dust 
extinction experienced by gas emission by a factor of 2, 
which translates into an average increase in SFR of only 
0.15 dex. 

4.2. Lack of AGN Contribution 

To exclude the possibility of any AGN contribution to 
the gravitational arc emission, in Figure [3] we show the 
[OIII1A5007/H/3 vs. [NII1A6584/Ha line ratio diagram 
(BPT diagram. iBaldwin et aHll98lD . In this plot star- 
forming galaxies and AGN populate separate regions due 
to the different ionization mechanisms at the origin of the 
line emission. All the gravitational arcs lie on or near the 
star-forming branch of the diagram, and we can firmly 
exclude the presence of AGN in our sample. 

It is interesting to note that the location of these z ~ 2 
star-forming galaxies on the BPT diagram is not coin- 
cident with the region most populated by low-redshift 
galaxies. In particular, none of our objects lie in the re- 
gion log([OIII]A5007/H/3) < 0, where the majority of lo- 
cal galaxies are found. A large [OIII]A5007/ H/3 ratio has 
already been reported in many studies of high-redshift 
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TABLE 4 

Physical properties of the sample 



Source 


log(M/Af ) a E(B-V) a log(Agc/yr) a SFR SED a 


SFR Ha b 


Linewidth b 12 + log(0/H) b FMR residual c 




(M Q /yr) 


(M /yr) 


(km/s) 



A611 


8.27 


± 


0.09 


0.11 


± 


0.03 


o o+0.3 


1.7 


± 


0.5 


2.0 


± 


1.1 


27 


± 


5 


7.89 


± 


0.19 


-0.10 


± 


0.21 


RXJ2129 


7.80 


± 


0.25 


0.18 


± 


0.02 




0.9 


± 


0.5 


0.6 


± 


0.4 


39 


± 


10 


7.89 


± 


0.40 


0.05 


± 


0.43 


A1413 


8.72 


± 


0.36 


0.07 


± 


0.06 


8.oi«: 


3.1 


± 


1.7 


7.2 


± 


4.3 


30 


± 


3 


7.89 


± 


0.33 


-0.24 


± 


0.38 


A1835 


8.33 


± 


0.22 


0.10 


± 


0.07 




1.1 


± 


0.7 


1.0 


± 


0.7 


54 


± 


7 


8.45 


± 


0.07 


0.38 


± 


0.15 


A773 


9.16 


± 


0.21 


0.10 


± 


0.05 


o 7 +0.4 
8 - 7 -0.5 


1.6 


± 


2.7 


•1.4 


± 


2.7 


50 


± 


7 


8.32 


± 


0.11 


-0.08 


± 


0.17 


MACS0717 


9.36 


± 


0.19 


0.27 


± 


0.02 


7Q +o.i 
'• a -o.i 


34 


± 


15 


15 


± 


10 


65 


± 


8 


8.53 


± 


0.10 


0.11 


± 


0.16 


A383 


8.67 


± 


0.16 


0.12 


± 


0.04 




3.4 


± 


1.3 


3.8 


± 


2.3 


54 


± 


8 


8.56 


± 


0.10 


0.41 


± 


0.14 


A1689 


8.27 


± 


0.24 


0.10 


± 


0.02 


8.0l]| 


2.2 


± 


1.3 








80 


± 


9 


7.89 


± 


0.39 


-0.10 


± 


0.41 


A1703 


8.49 


± 


0.27 


0.09 


± 


0.03 


8.6«; 7 


1.0 


± 


0.5 


4.1 


± 


2.9 


34 


± 


3 


7.84 


± 


0.14 


-0.22 


± 


0.22 



Note. — Stellar masses and star formation rates are corrected for the gravitational magnification. 
a Derived from SED fitting. 

b Derived from rest-frame optical emission lines. 

c Difference between the measured metallicity and the metallicity predicted by the FMR as formulated by Mannucci et al. (2011). 



star-forming galaxies (iShaplev et all l2005at lErb et all 
2006al [2010; Mainline et all l2009t iRichard et all 120111: 
Rigbv et all 20111) . and is indicative of a high ionizatio n 
parameter, as extensively discussed by lErb et all (|2010( ). 

4.3. Metallicities 

Rest-frame optical nebular lines contain a large amount 
of information on the physical conditions of the gas re- 
sponsible for the emission, including its metallicity. If the 
auroral line [OIIIJA4363 is detected, then it is possible to 
calculate the electron temperature and have a direct mea- 
surement of the metallicity. Unfortunately this line is so 
weak that at high redshift it has been detected only for a 
handful of objects. Instead, we derived the gas metallic- 
ity from the flux ratio of strong emission lines. There are 
several well-established methods to estimate the metal- 
licity from flux ratios, calibrated using either theoreti- 
cal calculations or observations of low-redshift galaxies. 
The absolute metallicity obtained with these methods is 
highly uncertain, and the different sets of calibrations, 
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Fig. 3.— BPT diagram ( Baldwin et al. 1981]). Our sample 
(red points) is compare d to the SDSS sample (gray density map, 
Kauff mann et aL1l2003l) . Also shown are the t heoretical (dashed 
line. l KewIev et al.l200ll) and empirical (solid line, Kauffma nn et al.1 
2003) separation between active galactic nuclei and star-forming 
galaxies. 



can differ by as much as 0.7 dex ()Kewlev fc Ellisonl2008|) . 
Although this discrepancy makes it very difficult to com- 
pare observational results obtained with different cali- 
brations, relative measurements obtained with the same 
strong line method are much more reliable. 

The main goal of the present study is to test whether 
the locally determined fundamental metallicity relation 
applies to high-redshift galaxies as well. The natural 
choice is th en to use the same m etallicity calibrations 
adopted by iMannucci et al.1 ()2010[ ) in the definition of 
the local relation. Th ese are the empirical calibrations of 
iMaiolino et"all (|2008l ). which give a polynomial fit for the 
value of various nebular line ratios as a function of the gas 
metallicity. The main line ratios are [OIIIJA5007/H/3 and 
[OIII]A5007/[OII]A3727,3729, while [NII]A6584/Ha and 
[NeIII]A3870/[OII]A3727,3729 were used only for some 
arcs, mostly as upper or lower limits. All the arc metal- 
licities were calculated using at least two line ratios, 
which correspond to a minimum of three independent 
line fluxes. This minimizes the effects of catastrophic 
errors due to sky line contamination. When many line 
ratios are available for the same object, they are always 
consistent with each other, within the observational un- 
certainties. 

It is worth remarking that the ratio between the flux of 
two lines is independent of the absolute flux calibration 
even for lines that lie in very distant parts of the spec- 
trum, since Triplespec allows us to observe the J, H and 
K band simultaneously. It is also independent of grav- 
itational magnification and slit loss. We corrected the 
line fluxes for dust extinction, using the SED fitting re- 
sults, in a differential way: the ratio of two lines depends 
only on the ratio of the attenuation at the correspond- 
ing wavelengths. This results in a small correction to 
the metallicity estimate and its uncertainty. The dust- 
corrected flux for Ha was also used as a proxy for H/3 
(assuming Ha/H/3 = 2.86) in order to have an additional 
constraint. 

The relation between [OIII]A5007/H/3 and the metal- 
licity is not monotonic, and presents a maximum at 
12 + log(0/H) = 7.89. Since this is a stationary point, 
any uncertainty in the line ratio is transformed into a 
much larger uncertainty in the metallicity. About half 
of our sample is found in this location, with large un- 



10 



Belli et al. 




0.1 1.0 10.0 100.0 

HctSFR (M s /yr) 



Fig. 4. — Comparison of star formation rates derived using two 
methods: SED-fitting and Ha flux. The empty circle is A1703 for 
which H/3 has been used as a proxy for Ha. 

certainties on log(0/H), up to 0.4 dex. The shape of 
this calibration also causes the existence of two possible 
metallicity values in some cases, but an unambiguous so- 
lution is always found thanks to the other line ratios. 

The final gas metallicity for each arc is taken to be the 
weighted average of the results from all the available line 
ratios and is given in Table [4] 

The metallicity of the two gravitation al arcs A1689 
and A 1835 has already been measured bv lRichard et al.l 
(2011) from near-infrared spectra obtai ned with Keck 
NIRSP EC and using the same set of Maiolino et al. 
(2008) calibrations. Our results are in good agreement. 

4.4. Star Formation Rates 

We derived the current star formation rate from the 
extinction-co rrected Ha emiss ion flux using the calibra- 
tion given bv lKennicuttl (j!998l ). dividing the result by 1.7 
to convert to that appropriate for a Chabrier IMF. The 
resulting star formation rates, corrected for the gravita- 
tional magnification, are reported in Table |H 

The SFR derived from nebular emission accounts for 
the star formation activity in the physical region of the 
arc that is covered by the slit, which is different, in prin- 
ciple, from the star formation rate of the entire galaxy. 
But the narrow gravitational arcs from our sample are 
easily covered by the 1 arcsec wide slit, as is apparent 
from the image stamps in Figure [TJ Therefore we do not 
attempt to correct for this effect, which is in any case less 
important than the uncertainty caused by slit alignment 
and seeing variability. 

Figure [4] shows excellent agreement between the star 
formation rates calculated using SED fitting and Ha flux. 
This is encouraging because it validates the numerous 
assumptions made in the derivation of the star formation 
rates. It is particularly interesting that the agreement 
between SED-fitting and nebular emission even holds for 
A1413, where only two photometric points are available. 
It is possible that the simplifying choice of a constant 
star formation history, with the consequent decrease in 
the number of free parameters, helped reduce the scatter 
in the comparison between the two methods. 

In the following section we will always use the star 
formation rate derived from the Ha flux. The spectra 
of the two objects at z > 3 do not include Ha, which is 



rcdshifted outside the Triplespec range. For one of them 
(A1703, empty circle in Figure |4j we use the observed 
flux of H/3 as a proxy for Ha, assuming the theoretical 
line ratio discussed in Section [4. II and correcting for dust 
extinction. For A1689, for which both Ha and H/3 are 
not available, we use the SED fitting star formation rate. 

4.5. Line Widths 

The broadening of the emission lines due to the gas 
kinematics depends on the gravitational well of the galax- 
ies. Measuring the line widths can then give an estimate 
on the gravitational arc masses that is independent of 
SED fitting and gravitational lensing models. 

The observed velocity width of the nebular lines need 
to be corrected for the instrumental resolution, which was 
measured from the sky OH lines. For each source we ex- 
tracted the spectrum of the sky using the same procedure 
followed to extract the arc spectrum. We measured the 
dispersion of the brightest, unblended OH lines, which 
is 40-55 km s _1 depending on spectral order and wave- 
length. We calculated a linear fit of the ratio of the 
spectral resolution R to the order to as a function of 
wavelength, and used this to estimate the instrumental 
resolution for each nebular line. 

Most of the arc emission lines are well-resolved. For 
each source we take the weighted mean of the line widths 
of all the well-detected lines excluding [OII]A3727, 3729, 
which is a doublet and is not completely resolved. The 
results are listed in Table SJ 

Since a detailed lensing map is needed to measure the 
intrinsic radius of the gravitational arcs, we do not at- 
tempt to estimate the dynamical masses. The observed 
velocity dispersions, however, are unus ually low if com- 
pared to the results of similar studies (lLaw et al.1 [20091 : 
iForster Schreiber et al.ll2009t: Uones et al.ll2010D . This is 
an important confirmation of the low masses found in 
our sample. 

5. RESULTS 

In this section we combine our measurements of stellar 
mass, gas metallicity and star formation rate to explore 
the properties of our sample of low-mass galaxies. In 
particular, the goal of this work is to test whether high- 
rcdshift galaxies follow the local fundamental metallic- 
ity relation, claimed t o be valid up to at least z ~ 2.2 
(|Mannucci et al.ll2010D . 

5.1. The Mass-Metallicity Relation 

The mass-metallicity relation for our sample is plotted 
in Figure [5] togeth e r with the fit to the local relation from 
i Kewlev fc Ellison (|2008l ) and to the high-redshift one of 
lErb et all (|2006a[ h All the results shown in this figure 
have been derived using the same set of metallicity cal- 
ibrat ions (the fits shown are taken from Maiolin oet al.l 
2008:|, and are corrected for the choice of IMF) . 

The magnification caused by gravitational lensing al- 
lows us to probe stellar masses much smaller than those 
considered in previous studies of unlensed galaxies, even 
in the nearby Universe. This makes a direct comparison 
difficult, but it is clear from Figure [5] that our points do 
not lie on the extrapolation of neither the z ~ nor the 
z ~ 2 relations, and have a substantial scatter, larger 
than the observational uncertainties. 
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Fig. 5. — Mass-metallicity relation for our sa mple (red points). 
The fi t to the relation at z ~ ( black solid line, I Kcwlcy & Ellison 
l200gh and z ~ 2.2 (blue solid line. lErb et al.l2006all are also s hown . 
Note that these fits are calculated using the Maiolino ct al. (2008) 
metallicity calibrations. The dashed lines are extrapolation at low 
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Fig. 6. — Star formation rate versus stellar mass diagram. The 
main sequence of st ar-forming galaxies is s hown in black at z ~ 
flZahid et al.M2012bl) and at high redshift HWhitaker et al.ll20T!3) , 
with dotted lines being the extrapolation at masses below the com- 
pleteness limit. Our sample (red points) is roughly on the main se- 
quence at z ~ 2. Poi nts from other studies of lensed ga laxies at high 
redshift are shown: Hichard et al. QOllT) in orange. IWuyts et alj 



ruici 

(2012b) in gray, and Christcnscn ct al. (2012) in blue. 



I Yuan et al.l {2013) measured the mass-metallicity rela- 
tion for a sample of gravitational arcs at z ~ 2 obtained 
by combining their data with results from previous stud- 
ies. Although a comparison of the absolute measure- 
ments is not possible because they use a different metal- 
licity calibration, their results are similar to ours: the 
lensed galaxies tend to have lower metallicities than the 
SDSS galaxies but do not lie on a tight sequence. 

This results are consistent with the hypothesis of a 
mass-metallicity relation dependent on some other pa- 
rameter that is not necessarily the redshift. In the re- 
maining parts of this section we will explore the role of 
the star formation rate. 

5.2. Star Formation Rate versus Stellar Mass 



In Figure [S] we show the location of our sample in the 
SFR-stellar mass diagram (red points), compared to the 
results of other studies of lensed galaxies at hig h redshift: 
IRichard et ail (120111. in orange) .TWuvts et all (|2012bl in 
gray). and lChristensen et al.l ( 20121 in blue). Our sample 
populates the lower left corner, with masses and star 
formation rates on average lower than what probed by 
previous studies. In particular, we more than doubled 
the number of low-mass galaxies (M < 10 9 M Q ) at this 
redshift with known metallicity and SFR. 

In the mass-SFR plane, star-forming galaxies lie on a 
relatively tight relat ion often called the main sequence 
(jNoeske et al.l [2001 . This sequence evolves strongly 
with redshift, with the normalizati on decreasing over cos- 
mic time at least since z ~ 2.5 dWhitaker et all l20"i~2T; 
IZahid et"a l. 2012b, black lines in Figure|Hl). The gravita- 
tional arcs that we selected have star formation rates that 
are on or below the main sequence at z ~ 2. Previous 
studies of lensed galaxies did not reach such low values of 
SFR, with the exception of the work of lChristensen et al.l 
(l20ll . 

The very low star formation rate of these arcs, be- 
tween 0.6 and 15 M /yr, is of fundamental importance 
in this study. First, the fact that our sample lies on the 
main sequence means that these galaxies are represen- 
tative of the typical population of star-forming galaxies. 
Shallower studies are biased towards luminous galaxies 
with star formation rates much higher than the main 
sequence. These objects are thought to be in a star- 
burst phase, potentially caused by a merger, and are not 
representative of the typical conditions of star-forming 
galaxies. Secondly, it allows us to compare high and 
low redshift galaxies with similar star formation rates, 
and thereby directly address the goals of this paper. 
From Figure |6] we can see that local massive galaxies 
(9.5 < log M/Mq < 11) that lie on the main sequence at 
z ~ have SFRs comparable to our sample. This is not 
the case for the majority of lensed galaxies considered by 
previous studies. 

5.3. The Fundamental Metallicity Relation 

We now turn our attention to the fundamental metal- 
licity relation, a surface in the 3D parameter space of 
stellar mass, metallicity and sta r formation rate tightl y 
followed by the SDSS galaxies (jMannucci et all 12010( 1 . 
In Figure [7] we plot for each gravitational arc the dif- 
ference between the metallicity that we measure from 
nebular lines and the metallicity predicted by the local 
FMR given its stellar mass and star formation rate. We 
also show the points from the previous studies of lensed 
galaxies. Clearly there is a correlation between metallic- 
ity residuals and stellar mass: at low masses the FMR un- 
derestimates the metallicity. The mean metallicity resid- 
ual for our sample is 0.37 ± 0.09 dex. 

At such low masses, even the SDSS sample has few 
objects. In fact we are using here an extrapolation of 
the fundamental metallicity relation, which was origi- 
nally found for galax ies more massive than 10 9 ' 2 Mq. 
iMannucci et al.l (|201lD extended the local FMR down to 
10 8 - 3 Mq, finding a slightly different slope and a larger 
scatter. With this version of the fundamental metal- 
licity relation the residuals, shown in Figure [5J have a 
much smaller dispersion. The weighted average of the 
low- mass FMR residuals is 0.12 ± 0.07. However, the 
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Fig. 7. — Difference between measured metallicit y and the predic- 
tion o f the fundamentai metallicity relation from Mannucci ct al. 
pOlOh . The points are color-coded as in Figure [6] 

weighted mean is skewed toward galaxies with higher 
metallicity since they are measured with higher preci- 
sion, because the metallicity calibrations are not linear. 
This is clear from Figure [SJ where the group of five galax- 
ies aligned at 12 + log(0/H) ~ 7.9, the maximum of the 
[OIII]A5007/H/3 line ratio calibration, present the largest 
error bars. The arithmetic mean is not affected by this 
bias, and gives (Alog(0/H)) = 0.02 ± 0.08. The agree- 
ment of high-redshift lensed galaxies with the local fun- 
damental metallicity relation is remarkable, and strongly 
suggests that these objects lie on the relation indepen- 
dently of their redshift at least up to z ~ 3. Previous 
studies of lensed galaxies also found a general agreement, 
but without conclusive results for M < 10 9 M Q , perhaps 
because of a less strict selection in SFR. Also, this is the 
first time that the universality of the FMR is confirmed 
using galaxies at high redshift with a SFR which is ob- 
served in typical galaxies in the local Universe. 

From Figure [5] it is clear that our gravitational arcs 
show a relatively small scatter around the local funda- 
mental metallicity relation. The standard deviation of 
the metallicity offsets from the FMR is 0.24 dex, and 
the mean error in Alog(0/H) is 0.25 dex. Therefore 
the observed scatter could be in principle just a product 
of observational un certainties. In contrast , the standard 
deviation found by Mannuc ci et al.l (|2011[) for the SDSS 
sample is about 0.4 dex at 1O 8 ' 4 M , and is shown in 
gray in Figure [5] as a function of stellar mass. Although 
they do not report the typical errors on mass, star for- 
mation rate and metallicity, they claim that the observa- 
tional uncertainties are not large enough to explain the 
observed dispersion. Furthermore, SDSS galaxies are se- 
lected by requiring a signal to noise ratio greater than 
25 for the Ha flux, therefore the uncertainty in their 
metallicity must be much smaller than for our sample. 
Since the uncertainty in A log(0/H) is mainly due to the 
metallicity measurement, we conclude that the intrinsic 
scatter at z ^ 2 is most likely smaller than at z = 0. 
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Fig. 8. — Difference between measured metallici ty and the predic - 
tion of the FMR extension to low masses ( M annucci et al.ll20Tll '). 
Our points are shown in red (with A611 offset by 0.05 dex in mass 
for clarity). The standard deviation of the SDSS low-mass galaxies 
is shown in gray. 

6. DISCUSSION 

Our data confirm that the fundamental metallicity re- 
lation applies to low-mass galaxies at 1.5 < z < 3. This 
suggests that this relation is time-invariant and therefore 
universal. 

In the equilibrium model of iFinlator fc Pavel ([2008), 
metallicity and star formation rates are tightly con- 
nected to gas inflows and outflows, so that a change in 
one implies a consequen t change in the other (see also 
iDave et al.|[20TTal |20I3 ). If each of these processes arc 
in equilibrium, then the FMR is naturally explained. If a 
galaxy is perturbed, by e.g. a minor merger, after a cer- 
tain time it will return to the equilibrium configuration. 
The observed evolution in the mass-metallicity relation 
could be due to the fact that we are sampling galaxy 
populations with different star formation rates at differ- 
ent redshifts. This would explain why our points do not 
lie on the l ow-mass end of th e z ~ 2 mass-metallicity re- 
lation from lErb et"all (|2006al ). that was determined using 
relatively high-SFR galaxies. 

The low scatter that we found at high redshift may 
provide a valuable additional constraint for numerical 
or analytical models of galaxy evolution. This is par- 
ticularly important be cause at low masse s models have 
diverging predictions ( Za hid et al.ll2012a[) . In the equi- 
librium model, the observed scatter of the FMR is de- 
termined by how quickly a perturbed galaxy can return 
to equilibrium. This timescale, in turn, depends on the 
mass loading factor, a parameter that is fundamental for 
hydrodynamic simulations. The observation of the scat- 
ter in the FMR at different stellar masses and redshifts 
therefore gives important constraints on numerical sim- 
ulations of galaxy evolution. But current si mulations do 
not resolve stellar masses below 10 9 M Q (e.g. |D ave et al] 
1201 lM) . 

An interesting perspective on t he issue of the FMR 
scatter has been pointed out by I Zahid et"aLl (|2012al) . 
They studied the mass-metallicity relation at z ~ us- 
ing various samples from the literature and found a clear 
increase in the intrinsic scatter at low stellar masses. 
They suggest the possibility that this scatter is due to 
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a population of low-mass, metal-rich galaxies which are 
near the end of their star formation. At a fixed mass, 
the same amount of metals would give a higher metallic- 
ity measurement, since there is little gas left. Assuming 
that the scatter in the FMR is directly caused by the 
scatter in the mass-metallicity relation, our observations 
agree well with this scenario, since at high redshift such 
a population of low-mass galaxies terminating their star- 
formation would not be expected. 

7. SUMMARY 

We present near-infrared spectroscopic data for 9 grav- 
itational arcs between redshift 1.5 and 3.3, and the mea- 
surement of their stellar mass, gas metallicity and star 
formation rate. The use of strong gravitational lensing 
allows us to probe very low masses and star formation 
rates. Our sample more than doubles the number of 
galaxies with stellar masses below 10 9 M Q at z ~ 2 with 
known metallicity and SFR. Our main goal is to test 
whether these galaxies follow the fundamental metallic- 
ity relation discovered for local galaxies. 



We find that the gravitational arcs lie above the mass- 
metallicity relation at z ~ 2 but below the local relation. 
However, they also have SFRs that are roughly on the 
main sequence of star-forming galaxies. This means that 
they are representative of typical star-forming galaxies, 
i.e. they are not in a starburst phase. 

Our data are fully consistent with the low mass 
extension of the local fundamental metallicity relation 
(M annucci et~^[20Tl . with a mean metallicity offset of 
0.02 ± 0.08 dex. The dispersion around the FMR of 0.24 
dex is smaller than the one measured for local galaxies, 
and represents an important additional constraint for 
galaxy evolution models. 

We acknowledge Kevin Bundy for providing the WIRC 
reduction pipeline, Eiichi Egami for the IRAC mosaic im- 
ages, and Marceau Limousin and Drew Newman for some 
of the magnification factors. We thank Drew Newman 
and Gwen Rudie for useful discussions. RSE and SB are 
supported for this work via NSF grant 0909159. JR is 
supported by the Marie Curie Career Integration Grant 
294074. 
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